Length | Sentence |
---|---|
255 | Ji̍at-lân-jia Siâⁿ Pau-ûi Chiàn( Hàn-jī : 熱蘭遮城包圍戰, Eng-gí : The Siege of Fort Zeelandia), ùi se-goân 1661 nî 5 ge̍h 30 hō khai-sí, kàu 1662 nî 2 ge̍h 1 hō ûi-chí, sī Tīⁿ Sêng-kong ê pō͘-tūi pau-ûi Ji̍at-lân-jia Siâⁿ Hô-lân lâng ê chi̍t-tiûⁿ ûi-siâⁿ chiàn. |
255 | Tông-ūi-sò͘ tī kho-ha̍k ê hō-miâ-hoat (nomenclature) sī ēng goân-sò͘ ê miâ-jī ke 1 ê liân-jī-hō, aū-piah koh chiap chit-liōng-hoan lâi piáu-sī, chhin-chhiūⁿ kóng He-3 (helium-3), C-12 (carbon-12), C-14 (carbon-14), I-131 (iodine-131), U-238 (uranium-238). |
254 | M̄-koh in-ūi hiān-chhú-sî jîn-lūi-ha̍k ê gián-kiù hong-hoat tōa-hūn sī ēng tī 1-ê 1-ê ê siā-hōe / thôan-thé gián-kiù téng-kôan, thâu-chêng kóng--ê hit-ê cheng-chha ê só·-chāi í-keng ū lú lâi lú chē lâng teh thó-lūn, mā siū-tio̍h lú lâi lú chē ê cheng-gī. |
254 | Keng-lāi ū hái-kiap sai-hōaⁿ tē-it ê Káu-liông-chià pho̍k-pò͘-kûn, ū "Tiong-hôa kî-koan" ê Lí-hî-khoe, nn̄g-ê kéng-tiám sī "Tiong-kok oân-bí ká-kî 10-ka lú-iû sòaⁿ-lō͘" ê tē-saⁿ-miâ, sī "Bân-tang-pak chhin-chúi-iû" sòaⁿ-lō͘ ê tiōng-iàu chó͘-sêng pō͘-hūn. |
254 | Siat-sú soh-á lâi khì hō͘-i thu̍t-khì (thu̍t-liāu), khì hō͘-i tn̄g-khì (tn̄g-liāu), gû nā-sī un-sûn lán koh tī gû-phīⁿ khan-lâi têng pa̍k tio̍h-hó, gû nā-sī iâu-koh siàu-liân (sin-gû-á), ah-sī iá-sèng tài-tāng, gû lân-tit chū-iû sī oē cháu hō͘-lâng jiok. |
254 | To̍k-sèng-ha̍k (毒性學) sī teh gián-kiù goā-lâi bu̍t-chit ia̍h-sī to̍k-sèng tùi seng-bu̍t-thé ê iú-hāi chok-iōng kap i-ê chok-iōng ki-chè ê kho-ha̍k ; koh ē-ēng chìn-chi̍t-pō͘ ī-chhek in tùi jîn-thé kap seng-thài khoân-kéng ê gûi-hāi ê giâm-tiōng thêng-tō͘. |
253 | Tī it-kok-lióng-chè ê hong-chiam tiong, pau-hâm Hiong-káng kap Ò-mn̂g lóng ē-sái pó-liû chu-pún-chú-gī keng-chè kap siā-hōe chèng-tī chè-tō͘, ah Tiong-hoâ Jîn-bîn Kiōng-hô-kok ê kî-thaⁿ só͘-chāi ē kè-sio̍k si̍t-hêng Tiong-kok-sek siā-hōe-chú-gī chè-tō͘. |
253 | Koè-liáu bêng-hoat ê sî-kî, koan-chhat ê tiōng-tiám sī ài khó-lū hōan-chiá ê kiān-khong chêng-hòng, ì-goān, kháu-khiuⁿ ōe-seng ê ûi-hō·, nā khak-tēng ū bâi-ho̍k ê chêng-hêng, tio̍h-ài múi 1 tang he̍k 2 tang chò tēng-kî kiám-cha, pau-koah tiān-kong-phìⁿ. |
253 | Tû-liáu Panasonic, in mā ū sú-iōng kòe chē khoán pâi-chú, kî-tiong chi̍t-ê sī National, ùi Tâi-oân thoân-thóng siōng hông kiò chò Kok-chè-pâi (國際牌), tong-tē chèng-sek teng-kì ê Hôa-gí miâ sī chhái-ēng kū Ji̍t-pún-miâ Hàn-jī ê Sông-hā Tiān-khì lâi hō--ê. |
252 | Ūi-tio̍h tō·-choa̍t thoân-jiám-pēⁿ ê hoat-seng, thoân-jiám, kap liû-hêng, Tâi-oân chèng-hú tī 2004 nî chiaⁿ-goe̍h 20-hō siu-tēng Thoân-jiám-pēⁿ Hông-tī-hoat, bêng-pe̍k kui-tēng ta̍k-lūi hoat-tēng thoân-jiám-pēⁿ ê hông-tī chèng-chhek kap īn-èng chhò-si. |
By default, sentence length is limited by 255 characters. Therefore we usually see many sentences of maximal length 256.
Such long sentences again may result from sub-optimal preprocessing. In such cases, two sentences were not split.
Pleas note that 256 unicode characters may be more than 256 byte!
4.1.1 Shortest sentences
4.1.2 Sentences of fixed length I
4.1.3 Sentences of fixed length II
4.1.4 Sentences of fixed length III